Improving Writer Identification Through Writer Selection
نویسندگان
چکیده
In this work we present a method for selecting instances for a writer identification system underpinned on the dissimilarity representation and a holistic representation based on texture. The proposed method is based on a genetic algorithm that surpasses the limitations imposed by large training sets by selecting writers instead of instances. To show the efficiency of the proposed method, we have performed experiments on three different databases (BFL, IAM, and Firemaker) where we can observe not only a reduction of about 50% in the number of writers necessary to build the dissimilarity model but also a gain in terms of identification rate. Comparing the writer selection with the traditional instance selection, we could observe that both strategies produce similar results but the former converges about three times faster.
منابع مشابه
Offline Language-free Writer Identification based on Speeded-up Robust Features
This article proposes offline language-free writer identification based on speeded-up robust features (SURF), goes through training, enrollment, and identification stages. In all stages, an isotropic Box filter is first used to segment the handwritten text image into word regions (WRs). Then, the SURF descriptors (SUDs) of word region and the corresponding scales and orientations (SOs) are extr...
متن کاملFeature Selection Methods for Writer Identification: A Comparative Study
Feature selection is an important area in the machine learning, specifically in pattern recognition. However, it has not received so many focuses in Writer Identification domain. Therefore, this paper is meant for exploring the usage of feature selection in this domain. Various filter and wrapper feature selection methods are selected and their performances are analyzed using image dataset from...
متن کاملA Survey on Writer Identification Schemes
This paper presents a survey of the literature on writer identification schemes and techniques up till date. The paper outlines an overview of the writer identification schemes mainly in Chinese, English, Arabic and Persian languages. Taxonomy of different features adopted for online and offline writer identification schemes is also drawn at. The feature extraction methods adopted for the schem...
متن کاملImproving Grapheme Codebook Selection for Scribe Identification
In this paper we test several approaches to analysing grapheme codebook features for offline writer identification in medieval English scribal manuscripts. Current methods for selecting a codebook typically produce codebooks that perform no better than random grapheme selection, so our aim in this analysis is to identify potential methods of improving codebook selection. Three feature extractio...
متن کاملAutomatic Writer Identification in Medieval Papal Charters
Automatic writer identification and writer verification has recently received significant attention in the field of historical analysis. In this work a short overview of current approaches for writer identification is given. Current state-of-the-art results on contemporary data are related to different approaches for writer verification on a small dataset of datum lines extracted from papal cha...
متن کامل